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ATGAAATTTA GTAAAAAATA TATAGCAGCT GGATCAGCTG TTATCGTATC CTTGAGTCTA 
TGTGCCTATG CACTAAACCA GCATCGTTCG CAGGAAAATA AGGACAATAA TCGTGTCTCT 
TATGTGGATG GCAGCCAGTC AAGTCAGAAA AGTGAAAACT TGACACCAGA CCAGGTTAGC 
CAGAAAGAAG GAATTCAGGC TGAGCAAATT GTAATCAAAA TTACAGATCA GGCCTATGTA 
ACGTCACACG GTGACCACTA TCATTACTAT AATGGGAAAG TTCCTTATGA TGCCCTCTTT 
AGTGAAGAAC TCTTGATGAA GGATCCAAAC TATCAACTTA AAGACGCTGA TATTGTCAAT 
GAAGTCAAGG GTGGTTATAT CATCAAGGTC GATGGAAAAT ATTATGTCTA CCTGAAAGAT 
GCAGCTCATG CTGATAATGT TCGAACTAAA GATGAAATCA ATCGTCAAAA ACAAGAACAT 
CTCAAAGATA ATGAGAAGGT TAACTCTAAT GTTCCTGTAG CAAGGTCTCA GGGACGATAT 
ACGACAAATG ATGGTTATGT CTTTAATCCA GCTGATATTA TCGAAGATAC GGGTAATGCT 
TATATCGTTC CTCATGGAGG TCACTATCAC TACATTCCCA AAAGCGATTT ATCTGCTAGT 
GAATTAGCAG CAGCTAAAGC ACATCTGGCT GG AAAAAATA TGCAACCGAG TCAGTTAAGC 
TATTCTTCAA CAGCTAGTGA CAATAACACG CAATCTGTAG CAAAAGGATC AACTAGCAAG 
CCAGCAAATA AATCTGAAAA TCTCCAGAGT CTTTTGAAGG AACTCTATGA TTCACCTAGC 
GCCCAACGTT ACAGTGAATC AGATGGCCTG GTCTTTGACC CTGCTAAGAT TATCAGTCGT 
ACACCAAATG GAGTTGCGAT TCCGCATGGC GACCATTACC ACTTTATTCC TTACAGCAAG 
CTTTCTGCTT TAGAAGAAAA GATTGCCAGA ATGGTGCCTA TCAGTGGAAC TGGTTCTACA 
GTTTCTACAA ATGCAAAACC TAATGAAGTA GTGTCTAGTC TAGGCAGTCT TTCAAGCAAT 
CCTTCTTCTT TAACGACAAG TAAGGAGCTC TCTTCAGCAT CTGATGGTTA TATTTTTAAT 
CCAAAAGATA TCGTTGAAGA AACGGCTACA GCTTATATTG TAAGACATGG TGATCATTTC 
CATTACATTC CAAAATCAAA TCAAATTGGG CAACCGACTC TTCCAAACAA TAGTCTAGCA 
ACACCTTCTC CATCTCTTCC AATCAATCCA GGAACTTCAC ATGAGAAACA TGAAGAAGAT 
GGATACGGAT TTGATGCTAA TCGTATTATC GCTGAAGATG AATCAGGTTT TGTCATGAGT 
CACGGAGACC ACAATCATTA TTTCTTCAAG AAGGACTTGA CAGAAGAGCA AATTAAGGCT 
GCGCAAAAAC ATTTAGAGGA AGTTAAAACT AGTCATAATG GATTAGATTC TTTGTCATCT 
CATGAACAGG ATTATCCAGG TAATGCCAAA GAAATGAAAG ATTTAGATAA AAAAATCGAA 
GAAAAAATTG CTGGCATTAT GAAACAATAT GGTGTCAAAC GTGAAAGTAT TGTCGTGAAT 
AAAGAAAAAA ATGCGATTAT TTATCCGCAT GGAGATCACC ATCATGCAGA TCCGATTGAT 
GAACATAAAC CGGTTGG AAT TGGTCATTCT CACAGTAACT ATGAACTGTT TAAACCCGAA 
GAAGGAGTTG CTAAAAAAGA AGGGAATAAA GTTTATACTG GAGAAGAATT AACGAATGTT 
GTTAATTTGT TAAAAAATAG TACGTTTAAT AATCAAAACT TTACTCTAGC CAATGGTCAA 
AAACGCGTTT CTTTTAGTTT TCCGCCTGAA TTGGAGAAAA AATTAGGTAT CAATATGCTA 
GTAAAATTAA TAACACCAGA TGGAAAAGTA TTGGAGAAAG TATCTGGTAA AGTATTTGGA 
GAAGGAGTAG GGAATATTGC AAACTTTGAA TTAGATCAAC CTTATTTACC AGGACAAACA 
TTTAAGTATA CTATCGCTTC AAAAGATTAT CCAGAAGTAA GTTATGATGG TACATTTACA 
GTTCCAACCT CTTTAGCTTA CAAAATGGCC AGTCAAACGA TTTTCTATCC TTTCCATGCA 
GGGGATACTT ATTTAAGAGT GAACCCTCAA TTTGCAGTGC CTAAAGGAAC TGATGCTTTA 
GTCAGAGTGT TTGATGAATT TCATGGAAAT GCTTATTTAG AAAATAACTA TAAAGTTGGT 
GAAATCAAAT TACCGATTCC GAAATTAAAC CAAGGAACAA CCAGAACGGC CGGAAATAAA 
ATTCCTGTAA CCTTCATGGC AAATGC TT AT TTGGACAATC AATCGACTTA TATTGTGGAA 
GTACCTATCT TGGAAAAAGA AAATCAAACT GATAAACCAA GTATTCTACC ACAATTTAAA 
AGGAATAAAG CACAAGAAAA CTCAAAACTT GATGAAAAGG TAGAAGAACC AAAGACTAGT 
GAGAAGGTAG AAAAAGAAAA ACTTTCTGAA ACTGGGAATA GTACTAGTAA TTCAACGTTA 
GAAGAAGTTC CTACAGTGGA TCCTGTACAA GAAAAAGTAG CAAAATTTGC TGAAAGTTAT 
GGGATGAAGC TAGAAAATGT CTT.GTTTAAT ATGGACGGAA CAATTGAATT ATATTTACCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 



1/22 



TCAGGAGAAG TCATTAAAAA GAATATGGCA GATTTTACAG GAGAAGCACC TCAAGGAAAT 
CGTGAAAATA AACCATCTGA AAATGGAAAA GTATCTACTG GAACAGTTGA GAACCAACCA 
ACAGAAAATA AACCAGCAGA TTCTTTACCA GAGGCACCAA ACGAAAAACC TGTAAAACCA 
GAAAACTCAA CGGATAATGG AATGTTGAAT CCAGAAGGGA ATGTGGGGAG TGACCCTATG 
TTAGATCCAG CATTAGAGGA AGCTCCAGCA GTAGATCCTG TACAAGAAAA ATTAGAAAAA 
TTTACAGCTA GTTACGGATT AGGCTTAGAT AGTGTTATAT TCAATATGGA TCGAACGATT 
GAATTAAGAT TGCCAAGTGG AGAAGTGATA AAAAAGAATT TATCTGATTT CATAGCGTAA 
(SEQ ID NO: 1) 
FIGURE 1 
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AATTCCTTGT CGGGTAACTT CCGACCCGCA CGAAAGGCGT AATGATTTGG GCACTGTCTC 60 

AACGAGAGAC TCGGTGAAAT TTTAGTACCT GTGAAGATCC AGGTTACCCG CGACAGGACG 120 

GAAAGACCCC ATGGAGCTTT ACTGCAGTTT GATATTGAGT GTCTGTACCA CATGTACAGG 180 

15 ATAGGTAGGA GTCTAAGAGA TCGGGACGCC AGTTTCGAAG GAGACGCTGT TGGGATACTA 240 

CCCTTGTGTT ATGGCCACTC TAACCCAGAT AGGTGATCCC TATCGGAGAC AGTGTCTGAC 300 

GGGCAGTTTG ACTGGGGCGG TCGCCTCCTA AAAGGTAACG GAGGCGCCCA AAGGTTCCCT 360 

CAGAATGGTT GGAAATCATT CGCAGAGTGT AAAGGTATAA GGGAGCTTGA CTGCGAGAGC 420 

TACAACTCGA GCAGGGACGA AAGTCGGGCT TAGTGATCCG GTGGTTCCGT ATGGAAGGGC 480 

20 CATCGCTCAA CGGATAAAAG CTACCCTGGG GATAACAGGC TTATCTCCCC CAAGAGTTCA 540 

CATCGACGGG GAGGTTTGGC ACCTCGATGT CGGCTCGTCG CATCCTGGGG CTGTAGTCGG 600 

TCCCAAGGGT TGGGCTGTTC GCCCATTAAA QCCGCACGCG AGCTGGGTTC AGAACGTCGT 660 

GAGACAGTTC GGTCCCTATC CGTCGCGGGC GTAGGAAATT TGAGAGGATC TGCTCCTAGT 720 

ACGAGAGGAC CAGAGTGGAC TTACCGCTGG TGTACCAGTT GTCTTGCCAA AGGCATCGCT 780 

25 GGGTAGCTAT GTAGGGAAGG GATAAACGCT GAAAGCATCT AAGTGTGAAA CCCACCTCAA 840 

GATGAGATTT CCCATGATTA TATATCAGTA AGAGCCCTGA GAGATGATCA GGTAGATAGG 900 

TTAGAAGTGG AAGTGTGGCG ACACATGTAG CGGACTAATA CTAATAGCTC GAGGACTTAT 960 

CCAAAGTAAC TGAGAATATG AAAGCGAACG GTTTTCTTAA ATTGAATAGA TATTCAATTT 1020 

TGAGTAGGTA TTACTCAGAG TTAAGTGACC ATAGCCTAGG AGATACACCT GTACCCATGC 1080 

30 CGAACACAGA AGTTAAGCCC TAGAACGCCG GAAGTAGTTG GGGGTTGCCC CCTGTGAGAT 1140 

AGGGAAGTCG CTTAGCTCTA GGGAGTTTAG CTCAGCTGGG AGAGCATCTG CCTTACAAGC 1200 

AGAGGGTCAG CGGTTCGATC CCGTTAACTC CCAAAGGTCC CGTAGTGTAG CGGTTATCAC 1260 

GTCGCCCTGT CACGGCGAAG ATCGCGGGTT CGATTCCCGT CGGGACCGTT TAAGGTAACG 1320 

CAAGTTATTT TAGACTCGTT AGCTCAGTTG GTAGAGCAAT TGACTTTTAA TCAATGGGTC 1380 

35 ACTGGTTCGA GCCCAGTACG GGTCATATAT GCGGGTTTGG CGGAATTCTA ATCTCTTTGA 1440 

AATCATCTTC TCTCACTTTC CAAAACTCTA TTACCTCTTA TTATACCACA TTTCAATCTT 1500 

CAACTTCCCA GTAATATAAG CACCTCTGGC GAAAGAAGTT TCAATGTCCT AAAGTAATAA 1560 

GTGAATCCAA TTCAGGAACT CCAACAACAA AAGAAACATC TGGTGTCACA AGTATTGGAT 1620 

GGCACAGAGT CACGTGGTAG TCTGACCCTA GCAGAAATTT TAAATAGTAA ACTATTTACT 1680 

40 GGTTAATTAA ATGGTTAAAT AACCGGTTTA GAAAACTATT TAATAAAGTA AAAGAAGTTG 1740 

AGAAAAAACT TCATCATTTA TTGAAATGAG GGATTTATGA AATTTAGTAA AAAATATATA 1800 

GCAGCTGGAT CAGCTGTTAT CGTATCCTTG AGTCTATGTG CCTATGCACT AAACCAGCAT I860 

CGTTCGCAGG AAAATAAGGA CAATAATCGT GTCTCTTATG TGGATGGCAG CCAGTCAAGT 1920 

CAGAAAAGTG AAAACTTGAC ACCAGACCAG GTTAGCCAGA AAGAAGGAAT TCAGGCTGAG 1980 

45 CAAATTGTAA TCAAAATTAC AGA^CAGGGC TATGTAACGT CACACGGTGA CCACTATCAT 2040 
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TACTATAATG GGAAAGTTCC TTATGATGCC CTCTTTAGTG AAGAACTCTT GATGAAGGAT 
CCAAACTATC AACTTAAAGA CGCTGATATT GTCAATGAAG TCAAGGGTGG TTATATCATC 
AAGGTCGATG GAAAATATTA TGTCTACCTG AAAGATGCAG CTCATGCTGA TAATGTTCGA 
ACTAAAGATG AAATCAATCG TCAAAAACAA GAACATGTCA AAGATAATGA GAAGGTTAAC 
TCTAATGTTG CTGTAGCAAG GTCTCAGCCA C GAT AT AC G A CAAATGATGG TTATGTCTTT 
AATCCAGCTG ATATTATCGA AGATACGGGT AATGCTTATA TCGTTCCTCA TCGAGGTCAC 
TATCACTACA TTCCCAAAAG CGATTTATCT GCTAGTGAAT TAGCAGCAGC TAAAGCACAT 
CTGGCTGGAA AAAATATGCA ACCGAGTCAG TTAACCTATT CTTCAACAGC TAGTGACAAT 
AACACGCAAT CTGTAGCAAA AGGATCAACT AGCAAGCCAG CAAATAAATC TGAAAATCTC 
CAGAGTCTTT TGAAGGAACT CTATGATTCA CCTACCGCCC AACGTTACAG TGAATCAGAT 
CGCCTGGTCT TTGACCCTGC TAAGATTATC AGTCGTACAC CAAATGGAGT TGCGATTCCG 
CATGGCCACC ATTACCACTT TATTCCTTAC AGCAAGCTTT CTGCTTTAGA AGAAAAGATT 
GCCAGAATGG TGCCTATCAG TGGAACTGGT TCTACAGTTT CTACAAATGC AAAACCTAAT 
GAAGTAGTGT CTAGTCTAGC CAGTCTTTCA AGCAATCCTT CTTCTTTAAC GACAAGTAAG 
GAGCTCTCTT CAGCATCTGA TGGTTATATT TTTAATCCAA AAGATATCGT TGAAGAAACG 
GCTACAGCTT ATATTGTAAG ACATGGTGAT CATTTCCATT ACATTCCAAA ATCAAATCAA 
ATTGGGCAAC CGACTCTTCC AAACAATAGT CTAGCAACAC CTTCTCCATC TCTTCCAATC 
AATCCAGGAA CTTCACATGA GAAACATGAA GAAGATGGAT ACGGATTTGA TGCTAATCGT 
ATTATCGCTG AAGATGAATC AGGTTTTGTC ATGAGTCACG GAGACCACAA TCATTATTTC 
TTCAAGAAGG ACTTGACAGA AGAGCAAATT AAGGCTGCGC AAAAACATTT AGAGGAAGTT 
AAAACTAGTC ATAATGGATT AGATTCTTTG TCATCTCATG AACAGGATTA TCCAGGTAAT 
GCCAAAGAAA TGAAAGATTT AGATAAAAAA ATCGAAGAAA AAATTGCTGG CATTATGAAA 
CAATATGGTG TCAAACGTGA AAGTATTGTC GTGAATAAAG AAAAAAATGC GATTATTTAT 
CCGCATGGAG ATCACCATCA TGCAGATCCG ATTGATGAAC ATAAACCGGT TGGAATTGGT 
CATTCTCACA GTAACTATGA ACTGTTTAAA CCCGAAGAAG GAGTTGCTAA AAAAGAAGGG 
AATAAAGTTT ATACTGGAGA ACAATTAACG AATGTTGTTA ATTTGTTAAA AAATAGTACG 
TTTAATAATC AAAACTTTAC TCTAGCCAAT GGTCAAAAAC GCGTTTCTTT TAGTTTTCCG 
CCTGAATTGG ACAAAAAATT AGGTATCAAT ATGCTAGTAA AATTAATAAC ACCAGATGGA 
AAAGTATTGG AGAAAGTATC TGGTAAAGTA TTTGGAGAAG GAGTAGGGAA TATTGCAAAC 
TTTGAATTAG ATCAACCTTA TTTACCAGGA CAAACATTTA AGTATACTAT CGCTTCAAAA 
GATTATCCAG AAGTAAGTTA TGATGGTACA TTTACAGTTC CAACCTCTTT AGCTTACAAA 
ATGGCCAGTC AAACGATTTT CTATCCTTTC CATGCAGGGG ATACTTATTT AAGAGTGAAC 
CCTCAATTTG CAGTGCCTAA AGGAACTGAT GCTTTAGTCA GAGTGTTTGA TGAATTTCAT 
GGAAATGCTT ATTTAGAAAA TAACTATAAA GTTGGTGAAA TCAAATTACC GATTCCGAAA 
TTAAACCAAG GAACAACCAG AACGGCCGCA AATAAAATTC CTGTAACCTT CATGGCAAAT 
GCTTATTTGG ACAATCAATC GACTTATATT GTGGAAGTAC CTATCTTGGA AAAAGAAAAT 
CAAACTGATA AACCAAGTAT TCTACCACAA TTTAAAAGGA ATAAAGCACA AGAAAACTCA 
AAACTTGATG AAAAGGTAGA AGAACCAAAG ACTAGTGAGA AGGTAGAAAA AGAAAAACTT 
TCTGAAACTG GGAATAGTAC TAGTAATTCA ACGTTAGAAG AAGTTCCTAC AGTGGATCCT 
GTACAAGAAA AAGTAGCAAA ATTTGCTGAA AGTTATGGGA TCAAGCTAGA AAATGTCTTG 
TTTAATATGG ACGGAACAAT TGAATTATAT TTACCATCAG GAGAAGTCAT TAAAAAGAAT 
ATGGCAGATT TTACAGGAGA AGCACCTCAA GGAAATCCTG AAAATAAACC ATCTGAAAAT 
GGAAAAGTAT CTACTGGAAC AGTTGAGAAC CAACCAACAG AAAATAAACC AGCAGATTCT 
TTACCAGAGG CACCAAACGA AAAACCTGTA AAACCAGAAA ACTCAACGGA TAATGGAATG 
TTGAATCCAG AAGGGAATGT GGGGAGTGAC CCTATGTTAG ATCCAGCATT AGAGGAAGCT 
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CCAGCAGTAG ATCCTGTACA 
TTAGATAGTG TTATATTCAA 
GTGATAAAAA AGAATTTATC 
ATCAAAAATG AAGTTCTCTC 
5 TTTATTATTA ATATATAAAA 

AGTTAATT (SEQ ID NO 

FIGURE 2 



AGAAAAATTA GAAAAATTTA 
TATGGATGQA ACGATTGAAT 
TGATTTCATA GCGTAAGGAA 
AAAACTTAGA AATAAAACTC 
TTTCTTGACA TACAACTTAA 
: 2) 



CAGCTAGTTA CGGATTAGGC 
TAAGATTGCC AAGTGGAGAA 
TAGCACTAGA AAAAGTCTGA 
TGACTTTGGG AGAATTTCAT 
AAAGAGGTGG AATATTTACT 
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10 ATGAAAATCA ATAAAAAATA TCTAGCTGGG TCAGTAGCTA CACTTGTTTT AAGTGTCTGT 
GCTTATGAAC TAGGTTTGCA TCAAGCTCAA ACTGTAAAAO AAAATAATCC TGTTTCCTAT 
ATAGATGGAA AACAAGCGAC GCAAAAAACG GAGAATTTGA CTCCTGATGA GQTTAGCAAG 
CGTGAAGGAA TCAACGCCGA ACAAATCGTC ATCAAGATTA CGGATCAAGG TTATGTGACC 
TCTCATGGAG ACCATTATCA TTACTATAAT GGCAAGGTCC CTTATGATGC CATCATCAGT 
GAAGAGCTCC TCATGAAAGA TCCGAATTAT CAGTTGAAGG ATTCAGACAT TGTCAATGAA 
ATCAAGGGTG GTTATGTCAT TAAGGTAAAC GGTAAATACT ATGTTTACCT TAAGGATGCA 
GCTCATGCGG ATAATGTCCG TACAAAAGAA GAAATCAATC GGCAAAAACA AGAACATAGT 
CAGCATCGTC AAGGAGGGAC TTCAGCAAAC GATGGTGCGG TAGCCTTTGC ACGTTCACAG 
GGACGCTACA CCACAGATGA TGGTTATATC TTCAATGCAT CTGATATCAT CGAAGATACG 
GGCGATGCCT ATATCGTTCC TCATGGAGAT CATTACCATT ACATTCCTAA GAATGAGTTA 
TCAGCTAGCC AGTTGGCTGC TGCAGAAGCC TTCCTATCTG GTCGGGAAAA TCTGTCAAAT 
TTAAGAACCT ATCGCCGACA AAATAGCGAT AACACTCCAA GAACAAACTG GGTACCTTCT 
GTAAGCAATC CAGGAACTAC AAATACTAAC ACAAGCAACA ACAGCAACAC TAACAGTCAA 
GCAAGTCAAA GTAATGACAT TGATAGTCTC TTGAAACAGC TCTACAAACT GCCTTTGAGT 
25 CAACGCCATG TAGAATCTGA TGGCCTTATT TTCGACCCAG CGCAAATCAC AAGTCGAACC 
GCCAGAGGTG TAGCTGTCCC TCATGGTAAC CATTACCACT TTATCCCTTA TGAACAAATG 
TCTGAATTGG AAAAACGAAT TGCTCGTATT ATTCCCCTTC GTTATCGTTC AAACCATTGG 
GTACCAGATT CAAGACCAGA AGAACCAAGT CCACAACCGA CTCCAGAACC TAGTCCAAGT 
CCGCAACCTG CACCAAATCC TCAACCAGCT CCAAGCAATC CAATTGATGA GAAATTGGTC 
30 AAAGAAGCTG TTCGAAAAGT AGGCGATGGT TATGTCTTTG AGGAGAATCG AGTTTCTCGT 
TATATCCCAG CCAAGAATCT TTCAGCAGAA ACAGCAGCAG GCATTGATAG CAAACTGGCC 
AAGCAGGAAA GTTTATCTCA TAAGCTAGGA GCTAAGAAAA CTGACCTCCC ATCTAGTGAT 
CGAGAATTTT ACAATAAGGC TTATGACTTA CTAGCAAGAA TTCACCAAGA TTTACTTGAT 
AATAAAGGTC GACAAGTTGA TTTTGAGGCT TTGGATAACC TGTTGGAACG ACTCAAGGAT 
35 GTCTCAAGTG ATAAAGTCAA GTTAGTGGAT GATATTCTTG CCTTCTTAGC TCCGATTCGT 
CATCCAGAAC GTTTAGGAAA ACCAAATGCG CAAATTACCT ACACTGATGA TGAGATTCAA 
GTAGCCAAGT TGGCAGGCAA GTACACAACA GAAGACGGTT ATATCTTTGA TCCTCGTGAT 
ATAACCAGTG ATGAGGGGGA TGCCTATGTA ACTCCACATA TGACCCATAG CCACTGGATT 
AAAAAAGATA GTTTGTCTGA AGCTGAGAGA GCGGCAGCCC AGGCTTATGC TAAAGAGAAA 
40 GGTTTGACCC CTCCTTCGAC AGACCATCAG GATTCAGGAA ATACTGAGGC AAAAGGAGCA 
GAAGCTATCT ACAACCGCGT GAAAGCAGCT AAGAAGGTGC CACTTGATCG TATGCCTTAC 
AATC TTC AAT ATAC TGTAG A AGTCAAAAAC GGTAGTTTAA TCATACCTCA TTATGACCAT 
TACCATAACA TCAAATTTGA GTGGTTTGAC GAAGGCCTTT ATGAGGCACC TAAGGGGTAT 
ACTCTTGAGG ATCTTTTGGC GACTGTCAAG TACTATGTCG AACATCCAAA CGAACGTCCG 
45 CATTCAGATA ATGGTTTTGG TAACGCTAGC GACCATGTTC AAAGAAACAA AAATGGTCAA 
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GCTGATACCA 
GAAGAAACCC 
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GACAACAATA 
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FIGURE 3 
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CAGAGATCTT AGTGAATCAA ATATACTTAA GAAAAGAGGA AAGAATGAAA ATCAATAAAA 
AATATCTAGC TGGGTCAGTA GCTACACTTG TTTTAAGTGT CTGTGCTTAT GAACTAGGTT 
TGCATCAAGC TCAAACTGTA AAAGAAAATA ATCGTGTTTC CTATATAGAT GGAAAACAAG 
CGACGCAAAA AACGGAGAAT TTGACTCCTG ATGAGGTTAG CAAGCGTGAA GGAATCAACG 
CCGAACAAAT CGTCATCAAG ATTACGGATC AAGGTTATCT GACCTCTCAT GGAGACCATT 
ATCATTACTA TAATGGCAAG GTCCCTTATG ATGCCATCAT CAGTGAAGAG CTCCTCATGA 
AAGATCCGAA TTATCAGTTG AAGGATTCAG ACATTGTCAA TGAAATCAAG GGTGGTTATG 
TCATTAAGGT AAACGGTAAA TACTATGTTT ACCTTAAGGA TGCAGCTCAT GCGGATAATG 
TCCGTACAAA AGAAGAAATC AATCGGCAAA AACAAGAACA TAGTCAGCAT CGTGAAGGAG 
GGACTTCAGC AAACGATGGT GCGGTAGCCT TTGCACGTTC ACAGGGACGC TACACCACAG 
ATGATGGTTA TATCTTCAAT GCATCTGATA TCATCGAAGA TACGGGCGAT GCCTATATCG 
TTCCTCATGG AGATCATTAC CATTACATTC CTAAGAATGA GTTATCACCT AGCGAGTTGG 
CTGCTGCAGA AGCCTTCCTA TCTGGTCGGG AAAATCTGTC AAATTTAAGA ACCTATCGCC 
GACAAAATAG CGATAACACT CCAAGAACAA ACTGGGTACC TTCTGTAAGC AATCCAGGAA 
CTACAAATAC TAACACAAGC AACAACAGCA ACACTAACAG TCAAGCAAGT CAAAGTAATG 
ACATTGATAG TCTCTTGAAA CAGCTCTACA AACTGCCTTT GAGTCAACGC CATGTAGAAT 
CTGATGGCCT TATTTTCGAC CCAGCGCAAA TCACAAGTCG AACCGCCAGA GGTGTACCTG 
TCCCTCATGG TAACCATTAC CACTTTATCC CTTATGAACA AATGTCTGAA TTGGAAAAAC 
GAATTGCTCG TATTATTCCC CTTCGTTATC GTTCAAACCA TTGGGTACCA GATTCAAGAC 
CAGAAGAACC AAGTCCACAA CCGACTCCAG AACCTAGTCC AAGTCCGCAA CCTGCACCAA 
ATCCTCAACC AGCTCCAAGC AATCCAATTG ATGAGAAATT GGTCAAAGAA GCTGTTCGAA 
AAGTAGGCGA TGGTTATGTC TTTGAGGAGA ATGGAGTTTC TCGTTATATC CCAGCCAAGA 
ATCTTTCAGC AGAAACAGCA GCAGGCATTG ATAGCAAACT GGCCAAGCAG GAAAGTTTAT 
CTCATAAGCT AGGAGCTAAG AAAACTGACC TCCCATCTAG TGATCGAGAA TTTTACAATA 
AGGCTTATGA CTTACTAGCA ACAATTCACC AAGATTTACT TGATAATAAA GGTCGACAAG 
TTGATTTTGA GGCTTTGGAT AACCTGTTGG AACGACTCAA GGATGTCTCA AGTGATAAAG 
TCAAGTTAGT GGATGATATT CTTGCCTTCT TAGCTCCGAT TCGTCATCCA GAACGTTTAG 
GAAAACCAAA TGCGCAAATT ACCTACACTG ATGATGAGAT TCAAGTAGCC AAGTTGGCAG 
GCAAGTACAC AACAGAAGAC GGTTATATCT TTGATCCTCG TGATATAACC AGTGATCAGG 
GGGATGCCTA TGTAACTCCA CATATGACCC ATAGCCACTG GATTAAAAAA GATAGTTTGT 
CTCAAGCTGA GAGAGCGGCA GCCCAGGCTT ATGCTAAAGA GAAAGGTTTG ACCCCTCCTT 
CGACAGACCA TCAGGATTCA GGAAATACTG AGGCAAAAGG AGCAGAAGCT ATCTACAACC 
GCGTGAAAGC AGCTAAGAAG GTGCCACTTG ATCGTATGCC TTACAATCTT CAATATACTG 
TAGAAGTCAA AAACGGTAGT TTAATCATAC CTCATTATGA CCATTACCAT AACATCAAAT 
TTGAGTGGTT TGACGAAGGC CTTTATGAGG CACCTAAGGG GTATACTCTT GAGGATCTTT 
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TGGCGACTGT CAAGTACTAT GTCGAACATC CAAACGAACG TCCGCATTCA GATAATGGTT 
TTGGTAACGC TAGCGACCAT GTTCAAACAA ACAAAAATGG TCAAGCTGAT ACCAATCAAA 
CGCAAAAACC AAGCGAGGAG AAACCTCAGA CAGAAAAACC TGAGGAAGAA ACCCCTCGAG 
AAGAGAAACC ACAAAGCGAG AAACCAGAGT CTCCAAAACC AACAGAGGAA CCAGAACAAG 
5 AATCACCAGA GGAATCAGAA GAACCTCAGG TCGAGACTGA AAAGGTTGAA GAAAAACTGA 
GAGAGGCTGA AGATTTACTT GGAAAAATCC AGGATCCAAT TATCAAGTCC AATGCCAAAG 
AGACTCTCAC AGGATTAAAA AATAATTTAC TATTTGGCAC CCAGGACAAC AATACTATTA 
TGGCAGAAGC TGAAAAACTA TTGGCTTTAT TAAAGGAGAG TAACTAAAGG TAGCAGCATT 2580 
TTCTAACTCC TAAAAACAGG ATAGGAGAAC GGGAAAACGA AAAATGAGAG CAGAATGTGA 2640 
10 GTTCTAG (SED ID NO : 4) 
FIGURE 4 
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GGGTCTTAAA ACTCTGAATC CTTTAGAGGC AGACCCACAA AATGACAAGA CCTATTTAGA 
AAATCTGGAA GAAAATATGA GTGTTCTAGC AGAAGAATTA AAGTGACCAA AGAATGAAAA 
TCAATAAAAA ATATCTAGCA GGTTCAGTGG CAGTCCTTGC CCTAAGTGTT TGTTCCTATG 
AACTTGGTCG TCACCAAGCT GGTCAGGTTA AGAAAGAGTC TAATCGAGTT TCTTATATAG 
ATGGTGATCA GGCTGGTCAA AAGGCAGAAA ATTTGACACC AGATGAAGTC AGTAAGAGAG 
AGGGGATCAA CGCCGAACAA ATTGTTATCA AGATTACGGA TCAAGGTTAT GTGACCTCTC 
ATGGAGACCA TTATCATTAC TATAATGGCA AGGTTCCTTA TGATGCCATC ATCAGTGAAG 
AACTTCTCAT GAAAGATCCG AATTATCAGT TGAAGGATTC AGACATTGTC AATGAAATCA 
AGGGTGGCTA TGTGATTAAG GTAGACGGAA AATACTATGT TTACCTTAAA GATGCGGCCC 
ATGCGGACAA TATTCGGACA AAAGAAGAGA TTAAACGTCA GAAGCAGGAA CACAGTCATA 
ATCATAACTC AAGAGCAGAT AATGCTGTTG CTGCAGCCAG AGCCCAAGGA CGTTATACAA 
CGGATGATGG GTATATCTTC AATGCATCTG ATATCATTGA GGACACGGGT GATGCTTATA 
TCGTTCCTCA CGGCGACCAT TACCATTACA TTCCTAAGAA TGAGTTATCA GCTAGCGAGT 
TAGCTGCTGC AGAAGCCTAT TGGAATGGGA AGCAGGGATC TCGTCCTTCT TCAAGTTCTA 
GTTATAATGC AAATCCAGTT CAACCAAGAT TGTCAGAGAA CCACAATCTG AC TGTC ACTC 
CAACTTATCA TCAAAATCAA GGGGAAAACA TTTCAAGCCT TTTACGTGAA TTGTATGCTA 
AACCCTTATC AGAACGCCAT GTAGAATCTG ATGGCCTTAT TTTCGACCCA GCGCAAATCA 
CAAGTCGAAC CGCCAGAGGT GTAGCTGTCC CTCATGGTAA CCATTACCAC TTTATCCCTT 
ATGAACAAAT GTCTGAATTG GAAAAACGAA TTGCTCGTAT TATTCCCCTT CGTTATCGTT 
CAAACCATTG GGTACCAGAT TCAAGACCAG AACAACCAAG TCCACAATCG ACTCCGGAAC 
CTAGTCCAAG TCTGCAACCT GCACCAAATC CTCAACCAGC TCCAAGCAAT CCAATTGATG 
AGAAATTGGT CAAAGAAGCT GTTCGAAAAG TAGGCGATGG TTATGTCTTT GAGGAGAATG 
GAGTTTCTCG TTATATCCCA GCCAAGGATC TTTCAGCAGA AACAGCAGCA GGCATTGATA 
GCAAACTGGC CAAGCAGGAA AGTTTATCTC ATAAGCTAGG AGCTAAGAAA ACTGACCTCC 
CATCTAGTGA TCGAGAATTT TACAATAAGG CTTATGACTT ACTAGCAAGA ATTCACCAAG 
ATTTACTTGA TAATAAAGGT CGACAAGTTG ATTTTGAGGT TTTGGATAAC CTGTTGGAAC 
GACTCAAGGA TGTCTCAAGT GATAAAGTCA AGTTAGTGGA TGATATTCTT GCCTTCTTAG 
CTCCGATTCG TCATCCAGAA CGTTTAGGAA AACCAAATGC GCAAATTACC TACACTGATG 
ATGAGATTCA AGTAGCCAAG TTGGCAGGCA AGTACACAAC AGAAGACGGT TATATCTTTG 
ATCCTCCTGA TATAACCAGT GATGAGGGGG ATGCCTATGT AACTCCACAT ATGACCCATA 
GCCACTGGAT TAAAAAAGAT AGTTTGTCTG AAGCTGAGAG AGCGGCAGCC CAGGCTTATG 
CTAAAGAGAA AGGTTTGACC CCTCCTTCGA CAGACCACCA GGATTCAGGA AATACTGAGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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CAAAAGGAGC AGAAGCTATC TACAACCGCG TGAAAGCAGC TAACAAGCTC CCACTTGATC 
GTATGCCTTA CAATCTTCAA TATACTGTAG AAGTCAAAAA CGGTAGTTTA ATCATACCTC 
ATTATGACCA TTACCATAAC ATCAAATTTG AGTGGTTTGA CGAAGGCCTT 
CTAACGGGTA TAGTCTTGAG GATCTTTTGG CGACTGTCAA GTACTATGTC GAACATCCAA 
ACGAACGTCC GCATTCAGAT AATGGTTTTG GTAACGCTAG TGACCATGTT CGTAAAAATA 
AGGCAGACCA AGATAGTAAA CCTGATGAAG ATAAGGAACA TGATGAAGTA AGTGAGCCAA 
CTCACCCTGA ATCTGATGAA AAACAGAATC ACGCTGGTTT AAATCCTTCA GCAGATAATC 
TTTATAAACC AAGCACTOAT ACGGAAGAGA CAGACGAAGA AGCTGAAGAT ACCACAGATG 



1980 
2040 
2100 
2160 
2220 
2280 
2340 

— nw-iBiMuan 1 ACCACAGATG 2 4 On 

10 TCCTCAAGTA GAGAATTCTQ ™TTAACGC TAAGATAGCA GATGCGGAGG 2460 

IZ^l^ ™^ ™!^ TAGTA ^CTATGGAG ACATTGACTG 2320 

2580 
2639 



15 



20 



25 



30 



___ m —"—www ACATTGACTG 

GTCTAAAAAG TAGTCTTCTT CTCGGAACGA AAGATAATAA CACTATTTCA GCAGAAGTAG 
ATAGTCTCTT GGCTTTGTTA AAACAAAGTC AACCCGCTCC TAT AC AG TAG TAAAATGAA 
(SEQ ID NO : 5) 
FIGURE 5 



MKFSKKYIAA GSAVTVSLSL CAYALNQHRS QENKDNNRVS YVDGSQSSQK 50 

SENLTPDQVS QKEGIQAEQI VIKITDQGW TSHGDHYHYY NGKVPYDALK 100 

SEELLMKDPN YQLKDADIVN EVKGGYIIKV DGKYYVYLKD AAHADNVRTK 150 

DEINRQKQEH VKDNEKVNSM VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 200 

YIVPHGGHYH YIPKSDLSAS ELAAAKAHIA GKNMQPSQLS YSSTASDNNT 250 

QSVAKGSTSK PANKSENLQS LLKELYDSPS AQRYSESDGL VFDPAXIISR 300 

TPNGVAIPHG DHYHFIPYSK LSALEEKIAR MVPISGTGST VSTNAKPNEV 350 

VSSLGSLSSU PSSLTTSKEL SSASDGYIFN PKDTVEETAT AYIVRHGDHF 400 

HYIPKSNQIG QPTLPNNSLA TPSPSLPINP GTSHEKHEED GYGFDANRII 450 

AEDESGFVMS HGDHNHYFFK KDLTEEQIKA AQKHLEEVKT SHNGLDSLSS 500 

HEQDYPGNAK EMKDLDKKIE EKIAGIMKQY GVKRESIWN KEKNAIIYPH 550 

GDHHHADPID EHXPVGIGHS HSNYELFKPE EGVAKKEGNK VYTGEELTNV 600 

VNLLKNSTFN NQNFTLANGQ KRVSFSFPPE LEKKLGINML VKLITPDGKV 650 

LEKVSGKVFC EGVGNIANFE LDQPYLPGQT FKYTIASKDY PEVSYDGTFT 700 

VPTSLAYKMA SQTIFYPFHA GDT YLRVNP Q FAVPKGTDAL VRVFDEFHGN 750 

AYLENNYKVG EIKLPIPKLN QGTTRTAGNK IPVTFMANAY LDNQSTYIVB 800 

VPtLEKENQT DKPSILPQFK RNKAQENSKL DEKVEEPKTS EKVEKEKLSE 850 

TGNSTSNSTL EEVPTVDPVQ EKVAKFAESY GMKLENVLFN MDGTIELYLP 900 

35 SGEVIKKNMA DFTGEAPQGN GENKPSENGK VSTGTVENQP TENKPADSLP 950 

EAPNEKPVKP ENSTDNGMLN PEGNVGSDPM LDPALEEAPA VDPVQEKLEK 1000 

FTASYGLGLD SVIFNMDGTI ELRLPSGEVI KKNLSDFIA (SEQ ID NO- 6) 1039 
FIGURE 6 



MKIKKKYLAG SVATLVLSVC AYELGLHQAQ TVKENNRVSY IDGKQATQKT 
ENLTPDEVSK REGINAEQXV IKITDQGYVT SHGDHYHYYN GKVPYDAIIS 
EELLMKDPNY QLKDSDIVNE IKGGYVIKVN GKYYVYLKDA AHADNVRTKE 
EINRQKQEHS QHREGGTSAN DGAVAFARSQ GRYTTDDGYI FNASDIIEDT 
45 GDAYIVPHGD HYHYIPKNEL SASELAAAEA FLSGRENLSN LRTYRRQNSD 
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NTPRTNWPS VSNPGTTNTN TSNNSNTNSQ 
ORHVESDGLI FDPAQITSRT ARGVAVPHGN 
IPLRYRSNHW VPDSRPEBPS PQPTPEPSPS 
KEAVRKVGDG YVFEEtfGVSR YIPAKNLSAE 
5 AKKTDLPSSD REPYNKAYDL LARIHQDLLD 
: VSSDKVKLVD DILAFLAPIR HPERLGKFNA 
EDGYIFDPRD ITSDEGDAYV TPHMTHSHWI 
GLTPPSTDHQ DSGNTEAKGA EAIWtRVKAA 
... . GSLIIPHYDH YHNIKFEWFD EGLYEAPKGY 

10 HSDNGFGNAS DHVQRNKNGQ ADTNQTEKPS 

SEKPESPKPT EEPEEESPEE SEEPQVETEK 
KSNAKETLTG LKNNLLFGTQ DNNTIMAEAE 
FIGURE 7 



ASQSNDIDSL LKQLYKLPLS 


300 


HYHFIPYEQM SELEKRlARI 


350 


PQPAPNPQPA PSNPIDEKLV 


400 


TAAGIDSKLA KQESLSHKLiG 


450 


NKGRQVDFEA LDNLLERLKD 


500 


QITYTDDEIQ VAKLAGKYTT 


550 


KKDSLSEAER AAAQAYAKBK 




KKVPLDRMPY NLQYTVEVKN 


650 


TLEDLLATVK YWEHPNERP 


700 


EEKPQTEKPE EETPREEKPQ 


750 


VEEKLREAED LLGKIQDPII 


800 


KLLALLKESK (SEQ ID NO; 7) 


840 



15 

MKINKXYLAQ SVAVLALSVC SYELGRHQAG QVKKESNRVS YIDGDQAGQK 
AENLTPDEVS KREGINAEQI VIKITDQGYV TSHGDHYHYY NGXVPYDAII 
SEELLMKDPN YQLKDSDIVN EIKGGYVIKV DCKYYVYLKD AAHADNIRTK 
EEIKRQKQEH SHNHNSRADN AVAAARAQGR YTTDDGYIFN ASDIIEDTGD 200 
20 AYIVPHGDHY HYIPKNELSA SELAAAEAYW NGKQGSRPSS SSSYNANFVQ 2 50 
PRLSENHNLT VTPTYHQNQG ENISSLLREL YAKPLSERHV ESDCLIFDPA 300 
QITSRTARGV AVPHGNHYHF IPYEQMSELE KRIARIIPLR YRSNHWVPDS 350 
RPEQPSPQST PEPSPSLQPA PNPQPAPSNP IDEKLVKEAV RKVGDGYVFE 
ENGVSRYIPA KDLSAETAAG IDSKLAKQES LSHKLGAKKT DLPSSDREFY 
25 NKAYDLIARI HQDLLDNKGR QVDFEVLDNL LERLKDVSSD KVKLVDDILA 
FLAPIRHPER LGKPNAQITY TDDEIQVAKL AGKYTTEDGY IFDPRDITSD 
EGDAYVTPHM THSHWIKKDS LSEAERAAAQ AYAKEKGLTP PSTDHQDSGN 600 
TEAKGAEAIY NRVKAAKKVP LDRMPYNLQY TVEVKNGSLI IPHYDHYHNT 650 
KFEWFDEGLY EAPKGYSLED LLATVKYYVB HPNERPHSDN GFCNASDHVR 700 
30 KNKADQDSKP DEDKEHDEVS EPTHPESDEK ENHAGLNPSA DNLYKPSTDT 750 
EETEEEAEDT TDEAEIPQVE NSVINAKIAD AEALLEKVTD PSIRQNAMET 800 
LTGLKSSLLL GTKDNNTISA EVDSLLALLK BSQPAPIQ 838 
(SEQ ID NO : 8) 
FIGURE 8 

35 



50 
100 
150 



400 
450 
500 
550 



40 



45 



TGTGCCTATG CACTAAACCA GCATCGTTCG CAGGAAAATA AGGACAATAA TCGTGTCTCT 
TATGTGGATG GCAGCCAGTC AAGTCAGAAA AGTGAAAACT TGACACCAGA CCAGGTTAGC 
CAGAAAGAAG GAATTCAGGC TGAGCAAATT GTAATCAAAA TTACAGATCA GGGCTATGTA 
ACGTCACACG GTGATCACTA TCATTACTAT AATGGGAAAG TTCCTTATGA TGCCCTCTTT 
AGTGAAGAAC TCTTGATGAA GGATCCAAAC TATCAACTTA AAGACGCTGA TATTGTCAAT 
GAAGTCAAGG GTGGTTATAT CAXCAAGGTC GATGGAAAAT ATTATGTCTA CCTGAAAGAT 
GCAGCTCATG CTGATAATGT TCGAACTAAA GATGAAATCA ATCGTCAAAA ACAAGAACAT 
GTCAAAGATA ATGAGAAGGT TAACTCTAAT GTTGCTGTAG CAAGGTCTCA GGGACGATAT 480 
ACGACAAATG ATGGTTATGT CTTTAATCCA GCTGATATTA TCGAAGATAC GGGTAATGCT 540 



60 
120 
180 
240 
300 
360 
420 
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10 



15 



20 



25 



30 



35 



TATATCGTTC c^atggagg tcactatcac tacattccca aaagcgattt atctgctagt 

GAATTAGCAG CAGCTAAAGC ACATCTGGCT GGAAAAAATA TGCAACCGAC TCAGTTAAGC 
TATTCTTCAA CACCTTCTCC ATCTCTTCCA ATCAATCCAG GAACTTCACA TGAGAAACAT 
GAAGAAGATG GATACGGATT TGATGCTAAT CGTATTATCG CTGAAGATGA ATCAGGTTTT 
GTCATGAGTC ACGGAGACCA CAATCATTAT TTCTTCAAGA AGGACTTGAC AGAAGAGCAA 
ATTAAGGCTG CGCAAAAACA TTTAGAGGAA GTTAAAACTA GTCATAATGG ATTAGATTCT 
TTGTCATCTC ATGAACAGGA TTATCCAAGT AATGCCAAAG AAATGAAAGA TTTAGATAAA 
AAAATCGAAG AAAAAATTGC TGGCATTATG AAACAATATG GTGTCAAACG TGAAAGTATT 
GTCGTGAATA AAGAAAAAAA TGCGATTATT TATCCGCATG GAGATCACCA TCATGCAGAT 
CCGATTGATG AACATAAACC GGTTGGAATT GGTCATTCTC ACAGTAACTA TGAACTGTTT 
AAACCCGAAG AAGGAGTTGC TAAAAAAGAA GGGAATAAAG TTTATACTCG AGAAGAATTA 
ACGAATGTTG TTAATTTGTT AAAAAATAGT ACGTTTAATA ATCAAAACTT TACTCTAGCC 
AATGGTCAAA AACGCGTTTC TTTTAGTTTT CCGCCTGAAT TGGAGAAAAA ATTAGGTATC 
AATATGCTAG TAAAATTAAT AACACCAGAT GGAAAAGTAT TCGAGAAAGT ATCTGGTAAA 
GTATTTGGAG AACGAGTAGG GAATATTGCA AACTTTGAAT TAGATCAACC TTATTTACCA 
GGACAAACAT TTAAGTATAC TATCGCTTCA AAAGATTATC CAGAAGTAAG TTATGATGGT 
ACATTTACAG TTCCAACCTC TTTAGCTTAC AAAATGGCCA GTCAAACGAT TTTCTATCCT 
TTCCATGCAG GGGATACTTA TTTAAGAGTG AACCCTCAAT TTGCAGTGCC TAAAGGAACT 
GATGCTOTAG TCAGAGTGTT TGATGAATTT CATGGAAATG CTTATTTAGA AAATAACTAT 
AAAGTTGGTG AAATCAAATT ACCGATTCCG AAATTAAACC AAGGAACAAC CAGAACGGCC 
GGAAATAAAA TTCCTGTAAC CTTCATGGCA AATGCTTATT TGGACAATCA ATCGACTTAT 
ATTGTGGAAG TACCTATCTT GGAAAAAGAA AATCAAACTG ATAAACCAAG TATTC TACCA 
CAATTTAAAA GGAATAAAGC ACAAGAAAAC TCAAAACTTG ATGAAAAGGT AGAAGAACCA 
AAGACTAGTG AGAAGGTAGA AAAAGAAAAA CTTTCTGAAA CTGGGAATAG TACTAGTAAT 
TCAACGTTAG AAGAAGTTCC TACAGTGGAT CCTGTACAAG AAAAAGTAGC AAAATTTGCT 
GAAAGTTATG GGATGAAGCT AGAAAATGTC TTGTTTAATA TGGACGG AAC AATTGAATTA 
TATTTACCAT CGGGAGAAGT CATTAAAAAG AATATGGCAG ATTTTACAGG AGAAGCACCT 
CAAGGAAATG GTGAAAATAA ACCATCTGAA AATGGAAAAG TATCTACTGG AACAGTTGAG 
AACCAACCAA CAGAAAATAA ACCAGCAGAT TCTTTACCAG AGGCACCAAA CGAAAAACCT 
GTAAAACCAG AAAACTCAAC GGATAATGGA ATGTTGAATC CAGAAGGGAA TGTGGGGAGT 
GACCCTATGT TACATTCAGC ATTAGAGGAA GCTCCAGCAG TAGATCCTGT ACAAGAAAAA 
TTAGAAAAAT TTACAGCTAG TTACGGATTA GGCTTAGATA GTGTTATATT CAATATGGAT 
GGAACGATTG AATTAAGATT GCCAAGTGGA GAAGTGATAA AAAAGAATTT ATTGATCTCA 
TAGCGTAA (SEQ ID NO : 9) 
FIGURE 9 



600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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1980 
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CAYALNQHR5 QENKDNNRVS YWGSQSSQK 
VIKITDQGYV TSHGDHYHYY NGKVPYDALF 

40 EVKGGYIIKV DGKYYVYLKD AAHADNVRTK 
VAVARSQGRY TTNDGYVFNP ADIIEDTGNA 
ELAAAKAHLA GKNMQPSQLS YSSTPSPSLP 
RIIAEDESGF VMSHGDHNHY FFKKDLTEEQ 
tSSHEQDYPS NAKEMKDLDK KIEEKIAGIM 

45 YPHGDHHHAD PIDEHKPVGI GHSHSNYELF 



SENLTPDQVS QKEGIQAEQI 50 

SEELLMKDPN YQLKDADIVN 100 

DEINfcQKQEH VKDNEKVNSN 150 

YIVPHGGHYH YIPKSDLSAS 200 

INPGTSHEKH EEDGYGFDAN 250 

IKAAQKHLEE VKTSHNGLDS 300 

KQYGVKRESI WNKEKNAII 350 

KPEEGVAKKE GNKVYTGEEL 400 
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TNWNLLKNS TFNNQNFTLA NGQKRVSFSF 
GKVLEKVSGK VFGEGVGNTA NFELDQPYLP 
TFOVPTSLAY KMASQTIFYP FHAGDTYLRV 
HGNAYLENNY KVGEIKLPIP KLNQGTTRTA 
XVEVPILEKE NQTDKPSILP QFKRNKAQEN 
LSETGNSTSN STLEEVPTVD PVQEKVAKFA 
YLPSGEVTKK NMADFTGEAP QGNCENKPSE 
SLPEAPNEKP VKPENSTDNG MLNPEGNVGS 
LEKFTASYGL GLDSVIFNMD GTIELRLPSG 
(SEQ ID NO : 10) 
FIGURE 10 



PPELEXXLGI NMLVKLITPD 450 

GQTFKYTIAS KDYPEVSYDG 500 

NPQFAVPKGT DALVRVFDEF 550 

GNKIPVTFMA KAYLDNQSTY 600 

SKLDEKVEEP KT5EKVEKEK 650 

ESYCMKLENV LFNMDGTIEL 700 

NGKVSTGTVE NQPTENKPAD 750 

DPMLDSALEE APAVDPVQEK 80O 

EVIKKNLLIS 840 



DQGYVTSHGD HYHYYNGKVP YDALFSEELL MKDPNYQLKD ADIVNEVKGG YIIKVDGKYY 
VYLKDAAHAD NVRTKDEINR QKQEHVXDNE KVNS 
(SEQ ID NO: 11) 
FIGURE 11 

GIQAEQIVIK ITDQGYVTSH GDHYHYYNGK VPYDALFSEE LL 
(SEQ ID NO: 12) 
FIGURE 12 

TAYIVRHGDH FHYIPKSNQI GQPTLPNNSL ATPSPSLPI 
(SEQ ID NO: 13) 
FIGURE 13 



TSNSTLEEVP TVDPVQEKVA KFAESYGMKL ENVLFN 
(SEQ ID NO: 14) 
FIGURE 14 



MDGTIELRLP SGEVIKKNLS DFIA 
(SEQ ID NO: 15) 
FIGURE 15 

YGLGLDSVIF NMDGTIELRIi PSGEVIKKNL SDFIA 
(SEQ ID NO: 16) 
FIGURE 16 

PALEEAPAVD PVQEKLEKFT ASYGLGLDSV IFNMDGTIBL RLPSGEVIKK NLSDFIA 
(SEQ ID NO: 17) 
FIGURE 17 
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KVEEPKTSEK VEKEKLSETG NSTSNSTLEE VPTVDPVQEK 
(SEQ ID NO: 18) 
FIGURE 18 

MKDLDKKIEE KIAGIMKQYG VXRBSIWNK EKNAIIYPHG DHHHADPIDE HKPVGIGHS1 
SNYELFKPEB GVAKKEGN 
(SEQ ID NO: 19) 
FIGURE19 

AIIYPHGDHH HADPIDEHKP VGICHSHSNY ELFKPEEGVA KXEGNKVYTG E 
(SEQ ID NO; 20) 
FIGURE 20 

IQVAKLAGKY TTEDGYIFDP RDITSDEGD 
(SEQ ID NO: 21) 
FIGURE 21 

DHQDSGNTEA KGAEAIYNRV KAAKKVPLDR MPYNLQYTVE VKNGSLIIPH YDHYHNIKFE 
WFDEGIiYEAP KGYSLEDLLA TVKYYV 
(SEQ ID NO; 22) 
FIGURE 22 

GLYEAPKGYS LEDLLATVKY YVEHPNERPH SDNGFGNASD H 
(SEQ ID NO: 23) 
FIGURE 23 

GLYEAPKGYSIiEDLLATVKYYV 
(SEQ ID NO: 163) 
Figure 24 
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Epitope Localization on BVH-3 Protein 



NH2 



leader 



conserved 



HGDHYH 



HGGHYH 




BVH-3C 



177-a.a. sequence 
(missing in hgdhyhI 
BVH-3Sp63) 



HGOHFH 



HGOHNHj 



HGDHHHfi 



hydrophobic tail 



H3-4F9 

H3-4D4 

H3-9H12 

H11-6E7 

H1M0H10 



*— 22; 



ffev-l 



9C6 



BVH-3A 



H3-7G2 
H3V-9C2 



DVH-3B 



L-509 
512 



NEW1 



1039 



-472 



NEW2 



H3V-4F3 
HN1-5H3 
HN1-8E3 
HN1-14F6 



NEW3 



H3V-2F2 

H3V-7F4 

HN1-2G2 

HN1-1208 

HN1-14B2 



U 800 



H3-4D3 

H3V-7H3 

H3V-13B8 

HNM0C12 

HN1-1G2 



COOH 



FIGURE 28 
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Epitope Localization on BVH-11-2 Prot in 



leader 



conserved 



proline 
rich (12 p/3o 



NH2 



HGDHYHi 



HGDHYHi 



HGNHYHj 



HMTHSHJ 



HYDHYH 



charged 

BVH-11-2-specific 



BVH-114 



NEW! 



bvh^ c ; H11-7G11 
H1 1* 



2H7 



r 271 



L 



H112-10D7 
H11B- 



H112-3A1 



NEW1J 



497 



H112-10G9* 
H112-10A2* 
H11B-11B8* 
H112-3E8 

- 699 



NEW11 



_ 



PAPIQ-COOH 



H112-3A4I 

-838 



* Surface-exposed and protection-conferring Mabs 

FIGURE 29 
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FIGURE 30 
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BVH-3M 1 CAYALNQHRSQENKDNNRVSVVDGSQSSQKSE^TPI^VSQKEGIQAEQIVIKITDQGW 

CAYALNQHRSQENKDNNRVSYVDGSQSSQ] 



5 BVH3-63 1 CAYALNQHRSQENKDNNRVS YVDGSQS SQKSENLTPDQVSQKEGI QAEQ IVIKI TDQGYV 60 



****** 



BVH-3M 61 TSHGDHYHYYNGKVPYDALFSEELLMKDPNYQLKDADIVNEVK^ 120 
io BVM3 ~« 6 1 TSHGDHYHYYNCKVPYDALF SEELLMKDPNYQLKDAJDIVNEVKGGYI IK^ 120 



15 



BVH3-63 121 AAHADNVRTKDE INRQKQEHVIOJNEKVNSWAVARSQCRYTTNDG YVFNPADI I EDTGNA 



80 
180 



BVH-3K 181 YIVPHGGHYHYI PKS DLSASELAAAKAHLAGKNMQPSQLS YS STASDNNTQSVAIO 
BVH3-63 181 YIVPHGGHYHYI PKSDLSASEIiAAAKAHLAGKNMQPSQLSYSS 



GSTSK 240 
223 



20 BVH-3M 241 PANKSENLQSLLKELYDSPSAQRYSESDGLVPDPAKIISRTPNGVAIPHGDHYHFIPYSK 3 00 
BVH3-63 224 

223 



BVH-3M 301 LSALEEKIARMVprsGTGSTVSTNAKPNEWSSX^SLSSNPSSLTTSKELSSASDGYIFN 3 60 
25 BVH3-63 224 

_ 223 



BVH-3M 361 PKDIVEETATAYIVRHGDHFHYIPKSNQIGQPTLPNNSLATPSPSLPtNPGTSHEKHEED 420 

BVH3-63 224 . 

3Q " TPSPSLPINPGTSHEKHEED 243 

*■**************,»*** 

BVH-3M 421 tjyGPDANRIIAEDESGFVMSHGDHNHYFFKXDLTEEQIKAAQKHLEEVKTSHNGIiDSLSS 480 
BVH3-63 244 GYGroANRlIAEDESGFVMSHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSS 303 

*******************##***A****^^^**i^^- - -._. jl _^ J ^ 



35 

BVH-3M 481 HEQDYPGNAKEMKDLDKKIEEKIAGIMKQYOVKRESrWNKEKNAIIYPHGDHHHADPID 540 



BVH3-63 304 HEQDYPSNAKEMKDI^KKIEEKIAGIMKQYC^RESIVVNKEKNAIIYPHGDHHHADPID 363 

40 BVH-3M 541 EHKPVGIGHSHSNYELFKPEEC^AKKEGNXVYTGEELTNVVNLLKNSTFNNQNFTLANGQ 600 
BVH3-63 364 EHKPVGIGHSHSNYELFKPEEGVAKKEGNKVYTGEELTNVVNLLKNSTFNNQNFTLANGQ 423 



BVH-3M 601 KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 660 
45 BVH3-63 424 KRVSFSFPPELEKKLGINMLVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQT 483 
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BVH3.3 --^^^rrr^ 1 ^^^^- »° 



J^tt I" VPILEKENQTDKPSILPQPKRNKAQENSKLJ5EKVEEPKTSEKVEKEKLSETCNSTSNSTL 840 
BVH3-63 3 ° 

* ** * * * 

5 B^3 3 63 I,.' ^^^^^^ 900 
BVH3-63 664 723 

BVH3-63 724 G^KPSENOKVSTCTVENQPTBNKPADSLPEAPNEKPVKPENSTDNGMLMPECNVGSDPM 783 

BVH3-63 784 LDSALEEAPAVDPVQEKLEKFTASYGLGLDSVIFNKDGTIELRLPSGEVIKKNLLIS 840 
FIGURE 31 



BVH 3 i l^FSKKYlAAGSAVIVSLSIXIAYAI^QHRSQENK-DNNRVSyVDGSQSSQKSENLTPDQV 55 

BVHU-2 1 ^I^^-SVAV^SVCSYEI^RHQAGQVKXES^VSVIDGDQA^KAENLTPDEV 59 



BVH-3 60 SQKEGIQAEQIVIXITDQGYVTSHGDHYHYYNGKVPYD. 

BVH-11-2 60 



BW 11 eo — w^x v I'attUUHXHYYNGiwPYDALFSEELLMKDPNYQLKDADlV 119 

BVH-11 59 SKl^GINAEQrVIKITDQGVVTSHGDHYHYYNGIfVpyDAIISEELLMKDPNYQLKDSDIV 118 

SKREGINAEQIVIKITDQGYVTSHGDHYHYWGKVPYDAIISEELLMKDPNYQLKDSDIV 119 



B^- 1 I! 9 ^ TOVI1 ^ I ^ ra ^^^I N RQXQH™ NB .--.KV NSN VAVAH 175 

B^ 11 2 120 ^"^^^^KDAAHAIWHTKEEINKQKQEHSQHREGGTSANDGAVAPAR 178 
BVH-11-2 1 20 NEIKGGYVIKVDGKYYVYLKDAAHADNIRTKEEIKRQKQBHSHNHN- - - SRADNAVAAAR 176 
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BVH-3 176 
BVH-11 179 SQGRYTTD] 



5 



BVH-11-2 177 



SQGRyTTNDGYVFOTADIIEDTGNAyX^HGGHYHYlPKSDLSASELAAAKAHLAGKNMQ 23 

SQGRYTTDDGYlFNASDIIEDTCDAYIVPHGDHYHYIPKireLSASELAAAEAFLSGKENL 2 3 8 

AQGRYTTDTCYIFNASDIlEDTGDAYIVPHGDHYHYrPKNELSASELAAAEAYWNGKQ- - 234 
.*»♦*** ***.*• .*«»**** .***+*, ***#«*. * 



236 PSQLSY^STASDKNTQSVAKGSTSKP A-N — - -KSENLQSLLKELYDSP 279 

3 



BVH-11 239 SNLRTYRRQNSDNTPRTNWVPSVSNPGTTNTNTSNNSNTNSQASQSNDIDSLLKQLYKLP 298 
BVH-11-2 23S -GSRPSSSSSYNANPVQPRLSENHNLTVTPTYHQN QGENI SSLLRELYAKP 284 



f * * * * n 



BVH-3 280 SAQRYSESIXSLWDPAKIISRTPNGVAIPHGDHyHPIPySKLSALEEKlARMVPISGTCS 339 

BVH-11 299 LSQRHVESDGLIFDPAQITSRTARGVAVPHGNHYHFIPYEQMSELEKRIARIIPLRYRSN 358 



BVH-11-2 285 



LSERHVESDGLIFDPAQITSRTARGVAVPHGNHyHFlPYEQMSELEKRlARIIPLRYRSM 3 
***** **** * *** ♦**.»* ******* # 4 



44 



BVH-3 340 TVSTNAXPNEWS SLGSLSSNP S SLTTS KELS S ASDGYI FNPKD IVEETATAY IVRHGDH 399 

BVH-11 359 HWVPDSRP-EEPSPQPTPEPSPS-PQPAPNPQPAPS- NP--IDEKLVXEAVRKVGEK3 410 

BVH-11-2 345 HWVPDSRP-EQPSPQSTPEPSPS-LQPAPNPQPAPS— -NP-IDEKLVKEAVRKVGDG 396 



•* * * •* * ** * * 



BVH-3 400 FHYIPKSNQIGQPTLPNNSLATPSPSLPIMPGTSHEKHEEDGYGFDAtJRIIAEDESGFVM 459 

BVH-11 411 YVFEE NGVSRYIP AKNLSAETAAGIDSKLAXQESLS 446 

BVH-11-2 397 YVFEE NGVSRYIP AKDLSAETAAGIDSKIAKQESLS- 432 



BVH-3 460 SHGDHNHYFFKKDLTEEQIKAAQKHLEEVKTSHNGLDSLSSHEQDYPGNAKEMKDLDKKI 519 
BVH-11 447 HKLGAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEALDNLLERLKDVS 502 



BVH-11-2 433 -— HKLGAKKTDLPSSDREFYNKAYDLLARIHQDLLDNKGRQVDFEVLDNLLERLKDVS 



488 



BVH-3 520 EEKIAGIMKQYGVKRESIVVNKEKNAIIYPHGDHHHADPIDEHKPVGIGHSHSNYELFKP 579 

BVH-11 503 SDKVKLVDDIIAFIAP--IRHPER--U3KPNAQITYTDDE1QVAKIAGKYTTEDGYIFDP 558 

BVH-11-2 489 SDKVKL VDD ILAFLAP - - 1 RHPER- - LGKPNAQ ITYTDDE IQVAKLAGKYTTEDGYI FDP 544 



BVH-3 580 EEGVAKKEGNKVYTGEELTNVVl^LKNSTFNNQ^^ 639 

BVH-11 559 RD-ITSDEGD-AYVTPHMTHSHWIKKDS-LSEAERAAAQAYAKEKGLTPPSTDHQD 611 

BVH-11-2 545 RD-ITSDECD-AYVTPHMTHSHWIKKDS-IjSEAERAAAQAYAKEKGLTPPSTDHQD 597 

• • • ** * •* • • * * . *. . **...• 

BVH-3 640 LVKLITPDGKVLEKVSGKVFGEGVGNIANFELDQPYLPGQTFKYTIASKDYPEVSYDGTF 699 

BVH-11 612 SGNTEAKGAEAIYTJRVKAAKKVPI/DRMPYNliQ YTVEVKNGSL 653 

BVH-11-2 598 SGNTEAKGAEAI YNRVKAAKKVPLDRMPYNLQ- - - YTVEVKNGSL 639 
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BVH-11 654 IIP— HYPHYHNIKFEWFDBO- LYEAPKG YTLEDLLAT III 

BVH-11-2 640 HP-HYOHYHHIXFEWF^C LYEAPKG L LED ^ T 

* * * **# 



10 760 GE IKLPI PKliNQGTTRTAGNKI PVTFMAKAYIiDNQSTYTVEVPILEKENQTDKPS ILPQ? 819 

10 BVH-11 689 VXYYVEHPNERPHSDNGFGN ASDHVQRN— . — KNGQADTN 

BVH-11-2 675 VKYYVEHPNERPHSDNGFCN ASDHVRKN KADQDSKP 710 

879 
742 



H K ^^ S ^ D ^EPKT SE KVEKEKL SE TGN S T S KSTI^EVPTVDPVQEKVAKFAES 879 
1J BVH-11 725 QTEKPSEEKPQTEKPE EE 

BVH-11-2 711 DEDKEHDEVSEPTHPESDEKE 



731 



20 Y ^^ F ^ TI ^ PSG ^*K^ra^PQGN^ 939 

BVH-11-2 732 " TPREEKPQSE - XPES - _- PK 758 

NHAGIiNPSADNLYKPSTD TE 751 

* ** 



25 BVH-3 



BVH-3 940 PTENKPADSLPEAPNEKPVKPENSTDNGMLNPEGNVGSDPHLDPALBEAPAV15PVQEKLE 999 

BVH-11 759 PTEEPEEESPEES - -EEPQVETEKVEEKLREAEDLLGK- - IQDPIIKSN -AKETLT 809 

BVH-11-2 752 ETEEEAEDTTDEA - - EI PQVENSVINAKIADAEALLEK- - VTDPS XRQN AMETLT 802 



. •* . + * 



30 BVH-3 1000 KFTAS YGLGLDSVI FNMDGTI ELRLP SGEVIKKNLSDF I A 1039 

BVH-11 810 GLKNNLLFGTQ DNNTIMAEAEKLLALLKESK 840 

BVH-11-2 803 GLKSSLLLGTK DNNTI SAEVDSLLALLKESQPAP I Q 838 



35 FIGURE 32 
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1 ATGCAAATTA CCTACACTGA TGATGAGATT 
61 ACAGAAGACG CTTATATCTT TGATACTAGT 
121 GAGAGAGCGG CAGCCCAGGC TTATGCTAAA 
181 CACCAGGAW CAGGAAATAC TGAGGCAAAA 
241 GCAGCTAAGA AGGTGCCACT TGATCGTATG 
301 AAAAACGGTA GTTTAATCAT ACCTCATTAT 
361 TTTGACGAAG GCCTTTATGA GGCACCTAAG 
421 GTCAAGTACT ATGTCGAACC GCGGAACGCT 
481 CAAGATAGTA AACCTGATGA AGATAAGGAA 
541 GAATCTGATG AAAAAGAGAA TCACGCTGGT 
601 CCAAGCACTG ATACGOAAGA GACAGAGGAA 
661 ATTCCTCGTA CCCCTAGTAT TAGACAAAAT 
721 AGTCTTCTTC TCGGAACGAA AGATAATAAC 
781 GCTTTGTTAA AACAAAGTCA ACCGGCTCCT 



CAGGTAGCCA AGTTGGCAGG CAAGTACACA 
TGGATTAAAA AAGATAGTTT GTCTGAAGCT 
GAGAAAGGTT TGACCCCTCC TTCGACACAC 
GGAGCAGAAG CTATCTACAA CCGCGTGAAA 
CCrTACAATC TTCACTATAC TGTAGAAGTC 
GACCATTACC ATAACATCAA ATTTGACTGG 
CGGTATAGTC TTGAGGATCT TTTGGCGACT 
AGTGACCATG TTCGTAAAAA TAAGGCAGAC 
CATGATGAAG TAAGTGAGCC AACTCACCCT 
TTAAATCCTT CAGCAGATAA TCTTTATAAA 
GAAGCTGAAC ATACCACAGA TGAGGCTGAA 
GCTATCGAGA CATTGACTGG TCTAAAAAGT 
ACTATTTCAG CAGAAGTAGA TAGTCTCTTG 
ATACACTAG (SEQ ID NO; 257) 



FIGURE 33 



1 MQITYTDDEI QVAKLAGKYT TEDGYIFDTS 
61 HQDSGNTEAK GAEAIYNRVK AAKKVPLDRM 
121 FDEGLYEAPK GYSLEDLLAT VKYYVEPRNA 
181 ESDEKENHAG LNPSAPNLYK PSTDTEETEE 
241 SLLLGTKDNN TISAEVDSLL ALLKESQPAP 



WIKKDSLSEA EHAAAQAYAK EKGLTPPSTD 
PYNLQYTVEV KNGSLIIFHY DHYHNIKFEW 
SDHVRKNKAD QDSKPDEDKE HDEVSEPTHP 
EAEDTTDEAE IPGTPSIRQN AMBTLTGLKS 
IQ (SEQ ID NO : 258) 



FIGURE 34 
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